Detection of Topic Change in IRC Chat Logs
نویسندگان
چکیده
We attack the problem of topic segmentation in the domain of Internet Relay Chat logs. In this process, we examine the previous work in text segmentation using a variety of methods. After considering the pros and cons of the methods, we employ Text Tiling, pause detection, and latent semantic analysis because they did not require the usage of large pre-tagged corpora. With these systems in place, we consider the properties and problems that exist when considering the domain of internet chat. To this end, we examine our results and show them to be fair at best.
منابع مشابه
Strangers in a Strange Land Interaction Management on Internet Relay Chat
This article examines a set of interactions (logs) takenfrom t h e f a n of computer-mediated communicntion known as Internet Relay Chat (IRC). The authors were particularly concerned with the interaction management strategies adopted by the participants in the logs during the opening and closing phases of the interactions to d m l o p interpersonal relationships and communicate socioemotional ...
متن کامل(Dis)agreements in Iranians’ Internet Relay Chats
The present study on politeness is an attempt to examine (dis)agreeing strategies utilized by EFL learners while chatting on the internet. Subjects of the study were forty male and thirty-three female Iranian natives whose internet relay chat (IRC) interactions, composed of 400 excerpts, were collected between December 2007 and September 2008. Data analysis was based on the general taxonomy of ...
متن کاملConcept drift detection in business process logs using deep learning
Process mining provides a bridge between process modeling and analysis on the one hand and data mining on the other hand. Process mining aims at discovering, monitoring, and improving real processes by extracting knowledge from event logs. However, as most business processes change over time (e.g. the effects of new legislation, seasonal effects and etc.), traditional process mining techniques ...
متن کاملAn Algorithm for Anomaly-based Botnet Detection
We present an anomaly-based algorithm for detecting IRC-based botnet meshes. The algorithm combines an IRC mesh detection component with a TCP scan detection heuristic called the TCP work weight. The IRC component produces two tuples, one for determining the IRC mesh based on IP channel names, and a sub-tuple which collects statistics (including the TCP work weight) on individual IRC hosts in c...
متن کامل1 Play , Art and Ritual on Irc ( Internet Relay Chat )
one of the world's most popular online chat modes. 1 Usually, IRC participants communicate via typed words. In contrast, this group communicates in real time mainly via the display of brilliantly colored visual images created from letters and other typographic symbols on the computer keyboard. Participants gather in a channel (chat room) called #mirc_rainbow, or " rainbow " for short. 2 While a...
متن کامل